PERTURBATIONS OF ROOTS UNDER 
LINEAR TRANSFORMATIONS OF POLYNOMIALS 
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Abstract. Let Vn be the complex vector space of all polynomials of 
degree at most n. We give several characterizations of the linear oper- 
ators T : Vn — > Vn for which there exists a constant C > such that 
for all nonconstant / G V n there exist a root u of / and a root v of Tf 
with \u — v\ < C. We prove that such perturbations leave the degree un- 
changed and, for a suitable pairing of the roots of / and Tf, the roots are 
never displaced by more than a uniform constant independent on /. We 
show that such "good" operators T axe exactly the invertible elements 
of the commutative algebra generated by the differentiation operator. 
We provide upper bounds in terms of T for the relevant constants. 



1. Introduction 

Let n be a positive integer, and denote by V n the (n + l)-dimensional 
complex vector space of all polynomials of degree at most n. Let T be a 
linear operator from V n to V n . In we proved that for each non-constant 
polynomial / G V n the polynomials / and Tf have at least one common 
root if and only if T is a non-zero constant multiple of the identity on V n . 
In other words, if T is not a multiple of the identity, then there exists a 
polynomial / G V n such that / and Tf do not share any roots. A natural 
question to ask is: How far apart are the roots of Tf from the roots of /? 

This requires that we introduce a measure of distance between finite sub- 
sets of the complex plane C. In Section |H] we introduce four such distances, 
among which are two common ones: du, the Hausdorff distance and cIf, 
the Frechet distance. The main result of this article is the characterization 
of the set of those linear operators T : V n — > V n f° r which there exists a 
constant C > such that for all / G V n , the distance between the roots 
of polynomials / and Tf is at most C. Here the distance can be any of 
the four distances that we introduce, which implies that this set of "good" 
operators will turn out not to depend on the distance used. 

A simple example of a "bad" operator is the operator R : V n V n which 
changes the sign of the independent variable, defined by 

(Rf){z) := f(-z), zeC, f G V n . 
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If the roots of / are large positive numbers, then the roots of Rf are negative 
numbers with large moduli, making any of the distances that we consider 
as large as we want. 

To illustrate our result let a be a complex number and consider the lin- 
ear operator S(a) : V n — > V n corresponding to the additive shift of the 
independent variable. It is defined by 

(1.1) (S(a)f)(z):=f(z + a), z £ C, feV n . 

It will be quite clear that (with respect to any of the four distances) the 
distance between the roots of / and the roots of S(a)f will be at most |a| 
(see Proposition 19.41) . The Taylor formula at z implies that the operator 
S(a) can be expressed as 

S{a) = I+^D + -D* + ... + — D\ 
1! 2! n! 

where D : V n — > V n is the operator of differentiation with respect to the 

complex variable. This example hints at the main result of this article stated 

in Theorems 17. 31 and 1 1 1 . 1 1 We paraphrase it below. 

Let T G C(V n ), T / 0, and let Z(f) denote the set of the roots of a 

non-constant / G V n . The following statements are equivalent. 

(i) There exists a constant C > 0, which depends on the distance d, such 
that d(Z(f), Z(T/)) < C for each non-constant / £ V n . 

(ii) There exist cto, ai, . . . , a n G C, ao / 0, such that 

(1.2) T = a I + a 1 D + a 2 D 2 + --- + a n D n . 

In (0) the symbol d can be replaced with any of the distances d m ,dh,dH, dp 
from Section |HJ Thus (0) really stands for four equivalent statements. More- 
over, for T described in (juj) and for each of the distances, in Theorem I1U. 41 
we give an estimate for the maximum possible distance between the roots 
of / and the roots of Tf in terms of T. 

Surprisingly, we found only one article, ^1], which considers the rela- 
tionship between (p and (jnj) as stated above. In ^1] an entirely different 
method was used to prove that (jnj) implies (0) with the distance d = dp. 
The converse was not considered in |14| . Also, no specific estimate for C is 
given there, which is in part due to the use of "soft" theorems from complex 
function theory. 

It is not surprising, though, that the location of the roots of Tf in relation 
to the roots of / for T as in (jnj) has been extensively researched, see |12l 
Sections 5.3 and 5.4]. In fact, the implication (jnl)=^(0), with the distance 
d = dh, is a consequence of Grace's theorem, [121 Theorem 5.3.1]. For 
completeness we include the details in Sections and El below. Furthermore, 
|12| Corollary 5.4.1] is fundamental for the proof that implies (0) with 
the distance d = dp. 

The article is organized in twelve short sections; the first section being 
this introduction. In Section [2] we recall Grace's theorem and one of its 
consequences. This consequence is of interest to us since in Section El we 
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restate it in terms of operators on V n . To this end, in Section El we study 
the algebra of operators given by (|1.2jl and a connection between this algebra 
and V n is explored in Section!!] Since the proofs in SectionsEJandElare short 
and interesting we have not omitted them. In Section El we present a version 
of Grace's theorem for linear operators on V n . This theorem and a result 
from Section El are the main tools in Sectional in which the first version of 
our main result is formulated as Theorem 17.31 This is where we prove the 
equivalence of (0) and (jHl) stated above. In addition, Theorem 17.31 among 
several equivalent statements, contains the converse of Grace's theorem for 
linear operators. In Theorem 17.31 we do not use the concepts of distances 
from Section El We wanted to keep the first part of the article, Sections E] 
through[71 independent of these concepts. 

However, the second part depends heavily on the four distances from 
Section El Why these four distances? The distance d m is the simplest 
("the two closest points distance"), and dp are implicitly already present 
in theorems about roots of polynomials, and the Hausdorff distance dn is 
probably the simplest distance which is a metric. In Sections I§1 and ITU1 for 
each distance, we give exact calculations and estimates for the maximum 
possible distance between the roots of / and the roots of Tf in terms of 
T. Finally, in Section ^2 we present the main theorem, Theorem lll.il We 
conclude with several examples in Section IT21 

We now introduce the basic notation. By deg(/) we denote the degree of 
a polynomial / G V n . For a non-zero polynomial / G Vn, Z{f) C C will 
denote the multiset of all the roots of /, that is, each root of / appears in 
Z(f) as many times as its multiplicity as a root of /. Thus Z{f) has exactly 
deg(/) elements, and these are not necessarily distinct. The distinction 
between sets and multisets is essential only when we consider the Frechet 
distance dp. In all other cases Z{f) can be considered simply as the set of 
roots of /. For completeness we set Z(0) = C. 

By C{V n ) we denote the set of all linear operators from V n to V n . We 
shall simply refer to elements in C(V n ) as operators. Whenever we need 
a basis for V n we shall use the basis {(f>o, (pi, . . . , </> n } (in this listed order) 
where <pk{ z ) '■= z k /k\, k = 0, 1, . . . , n. 

By 0(w, r) we denote the closed disk in C centered atwSC with radius 
r > 0. The letter z always stands for a complex number. For A,BcC 
we define A + B := {u + v : u £ A, wG B} and —A := {— u : u G A}. By 
conv(^4) we denote the convex hull of A. Thus, for / G V n , conv(Z(/)J is 
the convex hull of the roots of /. 

Finally, we thought it would be in reader's interest to try to have as many 
references as possible pointing to a single source, and the recent monograph 
by Rahman and Schmeisser ^21 is perfectly adapted to the task. Another 
standard reference in this field is [7] . 
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2. Grace's Theorem 

We begin with the definition of the *-product of polynomials which ap- 
pears in page 375] (see also pages 148 and 178 in [T2]). 

Definition 2.1. Let f,g £ V n be polynomials such that deg/ = degg = 
m > 0. Let afc = /^(0) and bk = g^ k \0) for k = 0, ... , re, be the coordinates 
of / and g with respect to the basis {4>o, <f>i, ■ ■ ■ , <ft n } ofV n . Set 

m m m /m—k \ ^ 

fc=0 k=0 k=0 \j=0 ) 

The reader can easily verify (or see |13j ) that the three sums that appear 
in the definition are equal for all z G C 

Definition 12. II requires that the polynomials which are being *-multiplied 
have the same degree. The definition depends on the common degree and 
the *-product is a polynomial of the same degree. It is clear that * is 
commutative. 

The next definition is equivalent to the standard one, see |12[ Defini- 
tion 3.3.1]. As before, (Rf)(z) = f(-z). 

Definition 2.2. Two polynomials / and g with equal positive degrees are 
apolar if (/ * (Rg)) (0) = 0. 

The symmetry of the apolarity relation follows from the straightforward 
equality g * (Rf) = (-l) m f * (Rg). 

The most important result about apolar polynomials is Grace's theorem, 
see [HJ Theorem 3.4.1]. 

Theorem 2.3 (Grace). Let f and g be apolar polynomials. If £1 is a circular 
domain and Z(g) C Q, then € Z(f) — £1. 

With S(a) as defined in (jl.lj) we clearly have Z(S(a)p) = {—a} + Z(p) 
for any polynomial p. Let now / and g be polynomials with the same 
positive degree. It is easy to verify that (S(a)f) * (Rg) = S(a)(f * (Rg))- 
Combining the last two equalities we conclude that S(a)f and g are apolar 
if and only if a £ Z(f * (Rg)). Applying Grace's theorem to S(a)f and g 
yields that Z(f * (Rg)) C Z(f) - O whenever SI is a circular domain and 
Z(g) C IX Since Z(Rg) = —Z(g) this leads to the following theorem, see 
PI Theorem 5.3.1]. 

Theorem 2.4. Let f and g be polynomials with the same positive degree. 
If £1 is a circular domain and Z(g) C £1, then Z(f * g) C Z(f) + 0. 

For a fixed g G V n with degree n, the mapping ►/*<?,/ € V n \P n -i, 
is a restriction of a linear combination of derivatives. We shall explore the 
relationship between the *-product and operators on V n further. For that 
purpose we first study linear combinations of derivatives. 
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3. The commutative algebra V(V n ) 

The following definition introduces our main object of study. 

Definition 3.1. Let D : V n — ► V n be the operator of differentiation on V n . 
By V(V n ) we denote the linear span in C(V n ) of the operators /,£),..., D n . 
If ao, ...,£% G C and 

(3.1) T = a / + aiD + --- + a n D n eP(P„), 

then we write T = T(ao, • • • , a n )- 

To get familiar with the operators in D(V n ) we first obtain their matrix 
representation with respect to the basis of V n defined as 

(3.2) {0o,0i,..., 4>n), (t> k {z) := z k /k\, k = 0,...,n. 
For each m G {0, . . . , n} we clearly have 

(3.3) D k (j) m = (j) m -k, < k < m, and D k (j) m = 0, m < k < n. 
Consequently, for T given by (|3.1[) . we have 

(3.4) T0 m = a m o + a m _i0i H h a\<f> m -\ + a o m , m = 0, ...,n, 

and therefore a m = (T0 m J (0), m = 0, . . . , n. Equalities Q3.4JI imply that the 
matrix of T with respect to the basis in (|3,2j) of V n is the following upper 
triangular Toeplitz matrix 



a ai a 2 
ao ai 
a 



On— 1 
On-2 

On-3 On-2 



a 




ai 



Additional basic information about V(T n ) is provided in the next three 
statements. 

Proposition 3.2. Let T G C{V n ). Then T G P(P„) if and only if T 
commutes with D. 

Proof. All elements of 2?('P n ) clearly commute with D. To prove the con- 
verse, set C n = {T G C(V n ) '■ TD = DT}. Clearly C n is a subspace of C{V n ) 
and V{V n ) C C n . Let T G C n . By T0 fc = TD n ' k (j) n = D n ~ k Tcf) n , 

for k = 0, ...,n. Hence T G C n is uniquely determined by T<f> n G V n . 
Consequently, the evaluation operator T i— > T0 n , T G C n , is an injection. 
Therefore dimC n < dim'Pn = n + 1. Since I,D,...,D n , are linearly in- 
dependent elements of C n , it follows that dim(C n ) = n + 1. Consequently 
C n =X>(P n ). □ 
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Corollary 3.3. T)(V n ) is a maximal commutative subalgebra of C{V n ). 

The proposition below can be proved in different ways. See, for example, 
the last paragraph in Section We include this proof since its method is 
also used in Section ITU1 

Proposition 3.4. Let T G ViVn). The operator T is invertible if and only 
if T(pQ 0. If T is invertible, then T _1 G V(V n ). 

Proof. The "only if part of the first statement is obvious. To prove the "if 
part assume that TcpQ ^ 0. First note that since D G 2?('P n ) is nilpotent, 
each operator / -7O 6 D('P n ) is invertible and 

(3.5) (/ - = I + 7 D + • • • + 7 n D n G V{V n ). 

Now let T G T>(V n ) be given by (|3.1|) and assume Tcf)Q / 0. Then, by 
(331, «o = T(p / 0. Following [El Section 5.4, p. 151], let 71, . . . ,j n be 
the roots of 

a z n + ai z n ~ H h a n _i 2 + a n 

counted according to their multiplicities. Then clearly, 

n n 

a -\ \-a n z n = a z" (z^ 1 - jj) = a (1 - Jjz) , 

and therefore, 

n 

(3.6) r = T(a 0> ...,o n ) = aon( J -7^)- 

i=i 

As a product of invertible operators, T is invertible. Since the inverse of each 
of its invertible factors is in 

V(P n ), Corollary O implies T _1 G V(V n ). □ 

4. The algebra V(V n ) and the vector space P„ 
In this section we explore the relationship between DiVn) and V n . 
Definition 4.1. Define w : V(V n ) -> 7> n by 

tJ7(T) := T0 n , T G V(V n ). 
If T = T(ao, . . . , a n ), then, by (|3.4() with m = n, 

(4.1) ror(r) = a </>n + «1 0n-l H h «n-l 01 + «n 00- 

Results from Section 01 and (|4.1|) yield the following proposition. 

Proposition 4.2. T/ie operator w is a linear bijection. The image under 
w of the set of all invertible operators in 2?("P n ) is the set of all polynomials 
of degree n. 

Since V{Vn) is a commutative algebra, it is natural to use w to equip V n 
with an algebra structure. We do that next. 
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Definition 4.3. The ^-product is defined on V n by 

(4.2) f*g:=w{w- l {f)m~\g)), f,g€V n . 

The properties of X>(7 3 n ) an d w yield the following corollary. 

Corollary 4.4. (a) The vector space V n equipped with the -k-product is a 
commutative algebra with unit 4> n . 

(b) The operator w : T){V n ) — > "P n is an algebra isomorphism. 

(c) TTie -k-invertible polynomials are exactly the polynomials of degree n. 

(d) For each m G {0, 1, . . . , n — 1}, t/ie set 

(4.3) {/£P„ :deg/ = n, /«(0) = 0, k = 0, . . . , n - m - l} 
is a -k-subgroup of the -k-group V n \Pn-i- 

Now we are ready to establish the connection with the operators on V n . 
Proposition 4.5. Let T G X>(7> n ). T/ien, 

Tf = f* (tu(T)) = (tJ7(T)) * /, / G 7>„, 
(Tf)*g = f*(Tg)=T(f*g), f,g€P n . 

Proof. Let T G D(P n ) and / G TV If / = ro(F) = F^ n , then tt7 _1 (/) = V, 
and therefore (ro _1 (/)) (0 n ) = /. Using successively the commutativity of *, 
the fact that w is an algebra isomorphism, the definition of w, and the last 
equality, we calculate 

f*(w(T)) =m(T)*f = m(Tw- 1 (f)) = T[(w- 1 (f))(<M) =Tf. 
Now the second claim follows from the associativity of the ^-product. □ 

The definition in 1)4.2)) is convenient since it emphasizes the connection 
between "D(V n ) and TV However, the formula for the ^-product in terms 
of the coordinates with respect to {(/>o, . . . , 4> n } is also useful. Let f,g G V n 
and a k = / (fc) (0) and b k = # (fc) (0) for k = 0, . . . , n. Now we first use 1)4. ljl to 
express m~ 1 (f) and w~ l {g) in terms of the coordinates of / and g, then we 
calculate the composition w~ 1 (f)w~ 1 (g), and again, we use 1)4.1)) to get 

n / n—k \ k 

(4-4) (f*g)(z) = J2 l^ak+jbn-j |r- 

fc=0 V j=0 / 

If deg/ = degg = n, a comparison of 1)4.4)1 and Definition 12.11 with m = n, 
yields that f * g = f * g. More generally, the following proposition holds. 

Proposition 4.6. Ze£ f,g€ TV deg / = m and degg = n. T/ien 
(4.5) f*(D n - m g)=f*g. 

For each m G {1, . . . ,n} the set V m \V m -\ with the ^-product is a commu- 
tative group. The mapping D n ~ m restricted to the set ()4.3|) is an isomor- 
phism between the commutative group 1)4.3)1 equipped with the ^-product and 
T > m\Pm-l with the ^-product. 
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Proof. Set afc = /^(0) and bk = g^ h \0) for k = 0, . . . ,n. Now regroup the 
terms in 1)4.4)1 and use a m +i = • • • = a n = to get a proof of 1)4.5)1 : 

71 771 

i=o i=o 

The mapping D n ~ m restricted to the set 1)4 .3)1 is clearly a bijection be- 
tween that set and V m \V m -i. Let now h and g be polynomials in the 
set 1)4.3)1 . Using 1)4.5)) and Proposition 14.51 we calculate 

(D n ~ m h) * (D n ~ m g) = (D n - m h) * 5 = £> n - m (/t * 0). 

This proves the last claim of the proposition. □ 

The fact that V m \V m -i with the *-product is a commutative group was 
proved in |13| . Moreover, in ^3] the reader can find a nice formula for the 
inverses. 

5. Grace's theorem for linear operators 

The next theorem is a restatement of Theorem 12.41 in terms of operators 
on V n . The role of g in Theorem 12.41 is now played by an invertible operator 
T G V(V n ). The union of the sets Z(T<f>k), k = 1, . . . , re, plays the role of 

z(g). 

Theorem 5.1. Let T G C(V n ). If T G T>{P n ) and T is invertible, then 
Z(Tf) C Z(f) + £2 for all f G "P«\{0} and for all circular domains Cl such 
that Z(T4>k) C £1, k = 1, . . . , re. 

Proof. Let T be an invertible operator in T>(V n )- For a constant non-zero / 
the theorem is obvious. Let / be a non-constant polynomial in Vn- Set m = 
deg/. Then, by 1)3.4)1 and Proposition 13.4) deg(T/) = m and deg(tu(T)) = 
deg(T</> n ) = re. Propositions 14.51 and 14.61 yield 

(5.1) Tf = f* [T<j> n ) = f * {D n ~ m T<p n ) . 

Let be a circular domain such that Z(T$k) C ^ for all A; = 1, . . . , re. Since 
by Proposition EH and if3~3|) . D n ~ m T(p n = TD n ~ m (p n = T<j> m , the theorem 
follows from 1)5.1)1 and Theorem 12.41 □ 

Corollary 5.2. Xe£ T 6e an invertible operator in "D(V n ) and let £1 be a 

convex circular domain such that Z(T(p n ) C 0. Then Z(Tf) C + f2 

/ora///GP n \{0}. 

Proo/. By Gauss-Lucas' theorem, see Q2 Theorem 2.1.1], Z(D n - k T(j) n ) C 
conv{Z(T(f) n )j, k = l,...,re. Hence, for a convex f2, Z{T(p n ) C implies 
Z(T<pk) C O, = 1, . . . , re, and Theorem 15 . 1 1 applies . □ 

Theorem 15.11 is a motivation for the following definition. 

Definition 5.3. An operator T G CiVn) will be called Grace operator if 
there exists a finite set A C C such that Z(Tf) C Z(/)+fi for all / G V n \{0} 
and for all circular domains £1 such that A C £1. 
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The question whether each Grace operator is an invertible operator in 
T)(V n ) will be answered by Theorem 17.31 

6. The first step towards the main result 

Lemma 6.1. Let T £ £{T n ), 0. Assume that there exists a constant 
C > such that for each non-constant polynomial f £ V n there exist u £ 
Z{f) and v £ Z(Tf) such that \u — v\ < C. Then the matrix of T with 
respect to the basis {(j>o, . . . , <pn\ of V n is upper triangular and the main 
diagonal entries are all equal to the same non-zero constant T<pQ. 

Remark 6.2. Note that the hypothesis of Lemma l6.1l implies that Z(Tf) ^ 
for all non-constant / £ V n . Further, the conclusion of the lemma implies 
that T maps constant polynomials into constants. 

Proof of Lemma Vf\.l\ Let m £ {1, ... ,n} and t > be arbitrary. Consider 
the polynomials 4> m — (t m /m\)(j)Q and T(0 m — (t m /m!)^o) • By hypothesis 
these two polynomials have roots which are at most C apart. Thus, for 
each t > there exists an m-th root of unity 6(t) such that the polynomial 
T(f> m — (t m /m\)T(f>o has a root w(t) in the disc H)(t 9(t),C). If we assume 
that T(/)q = 0, then the last statement would imply T(j) m = 0. Since m £ 
{1, . . . , n} is arbitrary, this would yield T = 0. But 0; hence, Tcfro ^ 
holds. 

Set v(t) = w(t) -t6(t), t > 0. Since v(t) £ D(0,C) and 0(t) £ O(0,l), 
for each k = 0, . . . , n, 

(6.1) (T^ite^ + vit)) =0(t dc ^ T ^), t^+oo, 

and deg(T^fc) is the smallest power of t for which (|6.1|) holds. The special 
case of (|6.1j) . with k = 0, implies, for each m = 0, . . . , n, 

(6.2) (t m /m\)(Tfo)(t9(t)+v(t)) =0(t m+de ^ T ^), t^+oo, 

and m + deg(T^o) is the smallest power of t for which (|6.2|) holds. Recall 
that by the definition of 9(t) and v(t) we have 

(6.3) (T<p rn )(t6{t) + v{t)) = (t m /m\)(T<i) Q ){te{t) + v{t)), t > 0. 
This, (jnU and (JSHJ imply 

deg(T(f> m ) = m + deg(T^ ), m = 0, 1,... ,n. 

Since deg(T^ ra ) < n, the last equality with m = n implies deg(Tc^o) = 0; 
that is, T(J)q is constant. Consequently, 

(6.4) deg(T0 m ) = m, m = 0, l,...,n. 

Hence, the matrix of T with respect to the basis {</>o, . . . , <ft n } of V n is upper 
triangular. 

The main diagonal entries of this matrix are equal to the limits 
,. (T(/> m )(z) 

lim / / \ ; m = 0, 1, . . . , n, 
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existence of which is a consequence of (|6.4[) . Since v(t) £ D(0, C) and 
6{t) £ B(0, 1), (|6.<3|) implies that these limits all equal to the constant Tfio. 
The lemma is proved. □ 

The following theorem is the first step towards a complete answer to the 
question posed in the Introduction. 

Theorem 6.3. Let T £ C(V n ), T / 0. Assume that there exists a constant 
C > such that for each non-constant f £ V n there exist u £ Z{f) and 
v £ Z(Tf) such that \u — v\ < C. Then 

(6.5) T = a I + ai D + --- + a n D n , 
where 

(6.6) a = (T0 Q )(O) ± and a k = (T0 fe )(O), k = l,...,n. 

Proof. Let T be as in the hypothesis. Define the coefficients a k , k = 0, . . . , n, 
by (|6.6|) . Next we shall prove ()6.5|) . 

From Lemma 16. II we know that the matrix of T with respect to the basis 
{<fio, . . . ,4> n } of V n is upper triangular and the main diagonal entries are all 
equal to the same non-zero number ao := (T(fro)(0) (and thus the statement 
ao 7^ in ()6.6|) is justified). This implies that T is invertible and 

(6.7) T0 O = ((T0 o )(O))0o = a o 0o. 

(Remember that </>o(-2) := 1 for all z G C). 

Since {<fio, . . . , <fi n } is a basis for V n , (|6,5|) is equivalent to 

(6.8) T(f> m = a (f) m + ai 4> m -\ H h a m _i 0i + a m (f> , m = 0,...,n. 

We prove ()6.8|) by complete induction with respect to m. By ()6.7|) equality 
(|6,8|) holds for m = 0. Let k £ {1, . . . , n} and assume that (|6,8|) is true for 
m = 0, ...,k — 1. We need to prove that (|6.8|) holds for m = k. By 
Lemma 16. 11 Tc^fc is a polynomial of degree k: 

k 

(T4> k ){z) = ^bk-j^jiz) 

where bj £ C, j = 0, . . . , /c. The rest of the proof is devoted to calculating 
the coefficients bj, j = 0, . . . , k. 

Let w £ Che arbitrary and consider the polynomial Pk(z) := (pk(z — w). 
Using the binomial expansion of (z — w) k and the induction hypothesis, for 
all z £ C, we get the identity 

k 

(T Pk )(z) = ^T±(-wy{T<j )k _ J )(z) 
j=o J - 

k ^ k-j 

= (T4> k )(z) + —(.-wY ^2 ai4> k -j-i(z) 
j=i J ' 1=0 



ai 
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b, 
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b k + E h i <t>k-i + E tS- w Y E 
1=0 
fc-i 

bk + Y, 



(k-j-iy. 



yk-j-l 



(k-iy. 

+ E "' 



1 2 fc-i 



i=0 

fc-i 



z=o 
fc-1 



6 * + E 



1=0 



k-l 
+ 



fc-i 



(fc-i)! 
(fc-i)! ' (fc-i)! 



a/ 



fc-i 

E 



"'V'^k-J 



fc - i 



W\3 

Z 



By hypothesis, for each u £ C there exists u(w;) G Z(Tpk) such that 
|«j — u(u;)| < C. Put v(itf) = u(u;) — it; and note that < C for all 

w £ C. The substitution z = ii(u>) = w + u(io) in the last long displayed 
identity yields 



fc-i 



which simplifies to 

fc-i ^ 
(6.9) ^fc + Eo- 



h ~ l /fc — A 

><+<.<E( , ) 



-u; 



to + v(w) 



(w + v(w)) k 1 



0, 



1=0 



using 



k-l 

E 

3=1 



(fc-i)! 



fc-i 
J 



fej + Oj 1 



Til + u(w) 



fc-i 



(w; + v(w)*f 1 = 0, 



u> + u(w) 
Regrouping terms in (|6,9j) yields 



io + v(w) 



k-l 



1. 



fc-1 



^ (fc-/)! 
2=0 v ; 



fc-i 

v(w) = 0. 



=0 



(fc-i) 



Since |f(w>)| < C for all w G C the last sum in (|dl()j) is a bounded function 
of w. Therefore (jtj. 1U|) implies 



fc-i 



^7 ( W + ' = 0(1); \w\ — > +OO. 



«=0 



(fc-i)! 



Again, since |v(w)| < C for all uifC, the last displayed relation yields 



6,-o, = 0, i = 0,1,..., fc-1. 



Since clearly b^ = (T^)(0) = a^, we have proved that Q6.8|) holds for 
m = k. By induction, (|6.8|) holds for all m = 0, 1, . . . , n, and the theorem is 
proved. □ 
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7. The first version of the main theorem 
Let / be a non-constant polynomial. The number 
e[f] : = max{|u| : u G Z(f)} 
is called the root radius of /. 

Proposition 7.1. Let T be an in invertible operator in Then for 

every non-constant f £ V n and for each v £ Z(Tf) there exists u £ Z(f) 
such that \v — u\ < g[T(f) n ]. 

Proof. Since the proposition is trivial for a non-zero constant multiple of the 
identity operator, we assume that T / a$I. Then, by (j3.4|) . g[T(j) n ] > 0. 
Let / G V n be a non-constant polynomial. Since O(0, g[T(p n ]) is a convex 
circular domain and Z(T(j) n ) C D(0, g[T(j) n fj , Corollary 15.21 yields 

Z(Tf)cZ{f) + B[0,g[T<M). 

Thus, for each non-constant / G V n , for every t; G Z(Tf) there exists 
u G Z(f) such that |« — v\ < g[T(p n ]. □ 

Remark 7.2. The conclusion of Proposition 17. II can also be expressed as 

Z(Tf)c [J D(it, e [T^ n ]). 

The following theorem is the main result of this section. 

Theorem 7.3. Let T G C(V n ), The following statements are equiv- 

alent. 

(a) T is an invertible operator in D(V n )- 

(b) T is a Grace operator. 

(c) There exists a constant C\ > such that for each non-constant f G V n 
and for each w G Z(Tf) there exists v G Z(f) such that \w — v\ <C\. 

(d) There exists a constant Ci > such that for each non- constant f G V n 
there exist u G Z{f) and v G Z(Tf) such that \v — u\ <Ci- 

(e) T is invertible and it commutes with the differentiation operator D. 

Proof. With A = \J{Z(T(f) k ) : k = 1, . . . , n}, the implication (jaj)^© fol- 
lows from Theorem 15. II 

The following short proof of (JbJ)=^(Jc)) is similar to the proof of Proposi- 
tion Assume (jbj) and let A C C be a finite set from Definition 15.31 Let 
Ci > be such that A C B(0,Ci). Let / G V n \V§. Since O(0, C x ) is a 
circular domain, by Definition 15 . 31 we have Z(T /) C Z(/)+B(0, Ci). Hence, 
for each w G Z(Tf) there exists u G Z{f) such that v — u £ O(0, Ci). This 
proves (jsj). 

The implication (jcj)=^(jdj) is obvious and (JdJ^lJaJ was proved in Theo- 
rem !6.3l Since (0)<^(|£j) is an immediate consequence of Proposition E21 the 
theorem is proved. □ 



PERTURBATIONS OF ROOTS OF POLYNOMIALS 



13 



8. Distances 



Recall the following standard definition: 

Definition 8.1. A function d : X x X —* [0, +oo) is a metric on a nonempty 
set X, if for all x,y, z £ X we have 

(a) d(x, x) = 0; 

(b) d(x,y) = implies that x = y; 

(c) d(x,y) = d(y,x); 

(d) d(x, z) < d(x, y) + d(y, z). 

The problem of measuring the distance between two finite sets of points 
has been considered in several seemingly unrelated areas of research. For a 
recent account and references see (Hj- 

The best known metric on the family of finite nonempty subsets of C is 
the Hausdorff metric defined as 



where A and B are nonempty finite subsets of C. That the function dn 
is really a metric on the family of finite nonempty subsets of C is a simple 
exercise. 

To connect the Hausdorff metric to Theorem 17. 31 we introduce two related 
functions. We informally call these functions distances since they indicate 
the location of points of one set in relation to the other. For two nonempty 
finite subsets A and B of C define 



Clearly neither of the functions d m and dh is a metric. The function d m 
satisfies only (jaj and (jcj) and dh satisfies only (Jaj) and (0J in Definition 18.11 
The function dh is sometimes called asymmetric Hausdorff distance. Both 
distances d m and dh appear implicitly in Theorem 17.31 

The definitions of d m and dh can be extended to include the empty set and 
the entire complex plane, which correspond to Z(f) for / G T'o\{0} and Z(0). 
For both d = d m and d = d hl we set d(0, 0) = and d(A, 0) = d(0, A) = +oo 
whenever A ^ 0. For either A = C or B = C the original definitions make 
sense, giving the value 0. It is in this extended sense that d m ,dh and tiff- 
will be used in the rest of the article. 

Another well known metric is the Frechet metric (see (Hj where a similar 
definition was first introduced, or 4, Chapter 6] where an analogous metric 
is defined for curves) . Let m be a positive integer and put M = { 1 , . . . , to} . 
By n m we denote the set of all permutations of M. For two functions 




d m (A, B) := min{ \x — y\ : x G A, y G 5} 
dh(A, B) := max {d m (A,{y}) : y£B}. 



Now, the Hausdorff metric can be expressed as 



d H (A,B) = max{d h (A,B),d h (B,A)}. 
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u, v : M — ► C we define 

d F (u,v) := min max \u(k) — v(o~(k))\ . 

o-en m fceM 1 1 

The function d F is not a metric on C m since it does not satisfy (jbj) in 
Definition 18.11 But dp is a metric on the factor set C m /~, where u ~ v 44> 
dp(u,v) = 0. The elements of the factor set C m /~ can be identified with 
"unordered" m-tuples, that is with multisets of m complex numbers in which 
a same element can appear more than once. This concept fits well with the 
sets of roots of polynomials where roots can occur with multiplicities. In 
this context the Frechet metric is defined for two multisets of m complex 
numbers U = {u±, . . . , u m } and V = {v\, . . . , v m } by 

d F (U,V) := min maxl-u^ — v a i^ . 

o-en m fceM 1 Wl 

In some ways this distance is a natural distance when perturbation of the 
roots of polynomials are studied, see ^Ql Theorem, p. 276], (Hj and [2]. For 
a simple proof that dp is a metric see j2]. 

We are interested in the question of what happens (in a quantitative sense) 
to the roots of polynomials under linear operators on V n . The following 
numbers give a "one number summary" answer to this question for T G 
C(P n ): 

K m (T) := su V {d m (Z(f), Z(Tf)) :/£?„}, 
K h (T) := sup{d h (Z(/), Z(Tf)) :feV n }, 
K H {T) := S u V {d H {Z(f),Z(Tf)) :/£?„}. 

The power of these definitions is in the fact that the statements from 
theorems in Sections |H1 and [3 can now be formulated in a more compact 
way. For example the hypothesis of Theorem 1(1 3\ is: K m (T) < +oo and the 
conclusion of Proposition I7T1 is: Kh(T) < g[T<f> n ]. 

Since the Frechet metric is defined only for multisets with the same num- 
ber of elements, we can define Kp(T) only for T £ £(V n ) with the property 
deg(T/) = deg(/) for every / G V n . For such T we define 

K F (T) := sup{d F (Z(f),Z(Tf)) : / G V n \V }. 

Proposition 8.2. Let T,T% G jC(V n ) and let a be a non-zero complex num- 
ber. Then 

(a) K m (T) < K h (T) < K H (T). 

(b) Kh{TiT) < Kh(T) + Kh{T\). 

(c) Kh^T) < K H (T) + K H {Ti). 

(d) For invertible T, K H (T) = max{K h (T), ^(T" 1 )} = K H (T- 1 ). 
If deg(/) = deg(T/) = deg(T 1 /) for all f G V n , then 

(e) K F (T X T) < K F {T) + K F (T X ). 

(f) K H (T) < K F (T) . 

(g) T is invertible and K F {T) = K F (T- 1 ). 
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Proof. The proofs follow directly from the definitions, properties of the un- 
derlying distances and properties of the supremum. A proof of (JbJ) follows: 

Khi^T) = m V {d h {Z(f), Z{T x Tf)) :/eP„) 

<sup{d h (Z(f),Z(Tf)) : f eV n } 

+ S up{d h {Z(Tf), Z(TiT/)) :feV n } 

< K h (T) + K^Ti). 

To prove (J3J) assume that T is invertible. Then T~ 1 V n = V n and therefore 
KhfT- 1 ) = sup{4(Z(/), Z{T- l f)) :f£V n } 
= sup{d h (Z(Tg),Z(g)) :g£V n }. 

Now the first equality in (jHJ follows from the definition of Kh(T). The 
second equality follows from the first when T is substituted by T _1 . 

The remaining statements are proved similarly. □ 

9. Exact Calculations 
We start with general results for and Kh- 
Theorem 9.1. Let T be an invertible operator in D(Vn)- Then 

K h {T) = e pz> n ] = d ft (z(^),z(r0 n )). 

Proof. By Proposition 17. 11 

d h (Z(f), Z(Tf)) < Q [T<l> n ], f e v n \v . 

Since T maps constants onto constants, it follows that Kh(T) < g[T(J) n ]. 
Clearly, d h (Z((p n ), Z(T(j) n )) = g[T<f> n ], and therefore, K h (T) > g[T(p n ]. □ 

Proposition !8.2l|Hf) now yields: 
Corollary 9.2. Let T be an invertible operator in Z?("P n ). Then 
K H (T) = max{ g[T '^ n ], £>[T _1 0n]} 

= m a x{d H [Z{(f> n ), Z{T(j> n )) , d H (Z(T- l (f> n ), Z(<f> n )) } . 

Remark 9.3. Theorem 19.11 conveys that the worst possible perturbation of 
roots measured by the distances dh occurs at the polynomial (p n . That is, 
for all feV n , 

d h {Z(f),Z(Tf)) < d h {Z{<j> n ),Z{T^ n )). 

For the distance dn, by Corollary 19.21 the worst possible perturbation of 
roots occurs either at n or at T~ l (f> n . That is, for all / E V n , 

dn (Z (f) , Z (T /)) < max {d H (Z(0 n ), Z{T<t> n )) , d H {Z(T~ Vn), Z(<t> n )) } . 

It would be interesting to know whether the last two inequalities must be 
strict when / is not a multiple of <f> n . 
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Exact calculations are possible only for simple operators in T>(V n )- We 
study two such classes. Let 0,7 G C. As before, S(a) G V(T n ) is the 
operator that shifts the independent variable by a defined in (fl.lj) . Further, 
we define 

H k {l) :=I-lD k , k=l,...,n. 

Proposition 9.4. Let a G C and consider S(a) G ViVn). Then 

K m (S(a)) = K h (S(a)) = K H (S(a)) = K F (S(a)) = \a\. 

Proof. For / G V n we have Z(S(a)f) = {—a} + Z{f). Therefore, 

d m (Z(f),Z(S(a)f)) < \a\ and d m (Z((j) n ), Z (S (a) (f> n )) = \a\. 

Hence K m (S(a)) = \a\. The same argument can be used for Kp. 
Since w[S(a)) = S(a)4> n , by Theorem 19. II we have 

K h (S(a)) = Q[S(a)(j) n ] = \a\. 

By Proposition OO, 

K H (S(a)) = max{K h (S(a)),K h (S{-a))} = \a\. □ 

Proposition 9.5. Let 7 G C and consider Hf-fa) G T>(V n ) for k = 1, . . . , n. 

Then 



1 jjjw! 
(n — k) 



Proof. By the definition of w, ^(.£^(7)) = H^^)^ and 
{H k ^i)4> n )(z) = (j) n (z) - 7 - ' 1 J< 



n\ \ (n — k)\ 



Consequently, g^H^)^^ = ypyl n!/(n — &)!, and the proposition follows 
from Theorem 19. II □ 

10. Estimates 

Let ao, • • • , a n G C, ao 7^ 0, and let T = T(ao, • • • , a n ) be the correspond- 
ing invertible operator in 2?('P n ). In this section we give estimates for the 
quantities Kh(T), Kh{T) and Kp(T) in terms of the coefficients ao, . . . ,a n . 

Proposition 10.1. Let 7 G C. Then 

(10.1) (n!) 1/n | 7 | ^^(^(T)" 1 ) <n| 7 |- 

Proof. To prove the first inequality in 1)10, 1|) consider the polynomial 

g = 7"- 1 0i + 7 n ~ 2 2 + • • • + 7 K-\ + <t>n- 

A straightforward computation gives H\{^)g = g — 7 g' = (j) n — 7™. Hence, g 
has a root at z = 0, while Hi(-y)g has all of its roots on the circle of radius 
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(n!) 1 /" - ^). If we put / = H\(-y)g, then g = H\('y)~ 1 f and the previous 
observation about the location of the roots implies that 

d A (z(/) > z(fr 1 ( 7 )- 1 /))>(n0 1/n l7|. 

This implies the first inequality in (|10. 1|) . 

To prove the second inequality notice that by (|3.5|) . 

n n 
fc=0 fc=0 

The roots of Ylk=o 't'k have been researched extensively; see for example 
jllj and the references therein. Here we only need that the root radius 
of the polynomial Ylk=o is smaller than or equal to n. Consequently, 
g^Hi('-f)^ 1 n ] < n I7I and the second inequality in (jlU.lj) follows from The- 
orem 19.11 □ 

Corollary 10.2. Let 7 G C. Then 

K H {H x in)) = KH^iT 1 ) = n| 7 |. 

Proof. The corollary follows from Propositions l8.2l(| d |) . l9.5l and llU.ll □ 

Proposition 10.3. Let 7 G C. Then 

K F [H x {<i)) <n 2 | 7 |. 

Proof. Let / be a non-constant polynomial in V n and 7 G C. Consider the 
set 

n . u D (» + ^,=M). 

wez(f) v 7 
By |12l Corollary 5.4.1(iii)] we have Z(H\(^)f) C f2, and in each connected 
component Q x , ■ ■ ■ > ^fc of O the polynomials / and H\{p/)f have the same 
number of zeros, counted according to their multiplicities. Therefore 

d F {Z{f),Z(H x {rt)f)) <max{diam(%) : j = 1, . . . , k} . 
Since for each j = 1, . . . , k, 

diam(O i ) < 2n T ^ = n 2 | 7 |, 

we conclude that K F (H\(^)) < n 2 \^y\. □ 

Theorem 10.4. Let ao, . . . , a n G C, oq 7^ 0. Let T = T(glq, . . . , a n ) be the 

corresponding invertible operator in D^n). Let 71, . . . ,j n be the roots of 

a z n + ai z n ~ l -\ h a n _i z + a n 

counted according to their multiplicities. Then 

(10.2) K h (T) < n (| 7 i| + --- + |7 ft |), 

(10.3) Kh{T) < n(\ji\ H + |7„|) , 

(10.4) K F (T) <n 2 (| 7 i| + --- + |7n|). 
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Proof. Lemma lfi.ll implies that deg(Tf) = deg(/) for all / £ V n . Thus 
Kf(T) is defined and T('P n \'Po) = V n \VQ. By repeated application of 
statements (JEJ),©, and (Q of Proposition 18.21 to ()3.6|) we get 

K{T) < if (#1(71)) + • • • + tf(#i(7n)), 



where if can be either if^, if#, or Kp. Now t|10.2j) follows from Proposi- 
tion (|10-3|) follows from Corollary 110.21 and (|10.4j) follows from Propo- 
sition inni □ 

Remark 10.5. The problem of estimating the sum of the absolute values of all 
the roots of a given polynomial was studied by Berwald, see [El Theorem 2.3]. 

Proposition 10.6. Let T be an invertible operator in V(V n ). Then 

(10.5) K H {T) < K F (T) < (en 3 lnn)K H (T). 

Proof. The first inequality was proved in Proposition 18.21 Let us now fix 
T £ V(V n ) and assume that 

T = I + ^D + -.. + 7 ^-D n - l + ^D n . 
1! (n — 1)1 n\ 

By the known estimate j!2l (8.1.12)], all the roots jj of the polynomial 

Q Z +aiz' L H h t -vr Z H - 

(n — lj! n! 

are smaller than Xw(l a il/iO - By Q10-4|) we then have 

n /i | \ 1 / j 

(10.6) ^(r)<n 3 ^(^) . 

On the other hand, by Theorem 19.11 we have 

(10.7) K h {T) = d h [Z{<t> n ), Z(T(<j> n ))) = e [T<t> n }. 
A lower estimate for g[T(p n ], where 

(T0 n ) (z) = ^ + Wi* + ■ ■ ■ + ( n " x ) "i*"" 1 + z ' 
is given by another classical inequality (see ^2j (8.1.1)]): 

(10.8) max \aA 1/j < g[T(j> n ]. 

l<j<n 

Combining (fTTH)|) . (tnUH) and (tTtTTl) and letting // = Ej=i(j ! )~ 1/j g ives 
K F (T) < n 3 fi max |a,| 1/j ' < n 3 fj,K h (T) < n Z fiK H (T). 

l<j<n 

To estimate fi, note that for j > 1 we have 

1 \ ' e e , 7 

< - < r < e In ■ ' 
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where the first inequality easily follows from Stirling's approximation [HI 
p. 183], while the last one is a special case of 3.6.17]. Since I+775 < eln2, 
adding up gives 

t j - = it(^) <elnn 

j= i 

and thus, finally, Kp(T) < (en 3 Inn) Kh(T) as we needed to prove. □ 

The factor en 3 lnn in the second inequality of (|10.5j) is plausibly far from 
being best possible, but we have not pursued this line of enquiry. 

11. The main theorem 

We can now finally give a comprehensive and compact answer to the 
question posed in the Introduction. 

Theorem 11.1. Let T G C(V n ), T ^ 0. The following statements are 
equivalent. 

(a) T G T>(V n ) and T is invertible. 

(b) T is a Grace operator. 

(c) K h {T) < +00. 

(d) K m {T) < +00. 

(e) T is invertible and TD = DT. 

(f) K H {T) < +00. 

(g) deg(Tf) = deg(/) for all f G V n and K F (T) < +00. 

Proof. The statements (jsj through (jsj) are equivalent by Theorem 17.31 

The implication jgj=^(jgj) follows from Proposition ^, 41 equalities Q3.4|) and 
Theorem 111). 41 Proposition I8.2l (|f|) yields @=^(0) and (p=^jcj) follows from 
Proposition 18 . 2] (jgj) • The theorem is proved. □ 

As V(V n ) is a commutative subalgebra of C(V n ) we have the following 
corollary. 

Corollary 11.2. If T, T\ G £("P n )\{0} satisfy any of the equivalent condi- 
tions (jsji-fej) in Theorem Ml.ll then T and T\ commute. 

Remark 11.3. By Proposition l3.4l the statement in Theorem lll.ll is equiv- 
alent to T -1 is invertible and T _1 G T>(V n ). Therefore, the statements about 
the operator T in Theorem 111.11 are equivalent to the corresponding state- 
ments about the operator T _1 . 

12. Examples 

In the next two examples we consider the impact of the operators from 
T^iVn) on the roots of polynomials in V n for n = 1,2, where the compu- 
tations can be carried out in all detail. While the situation for n = 1 is 
as trivial as expected, the case n = 2 gives a good insight into why direct 
calculations for larger matrices are bound to be unwieldy. 
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Example 12.1. InVi we have 

T(a , ai)(0i - w(f) ) = a (f>i - (a w - ai)<p . 

Provided that ao 7^ 0, the polynomial T(ao, ai)(<fti — iV(j>o) has a root at 
w — ai/ao- Thus, the operator T(ao,«i) shifts all the roots by exactly 
ai/ao. This also follows from T(ao,ai) = aoS(ax/ao); see (jl.lj) . 

Example 12.2. Let w\, W2 be arbitrary complex numbers. For the operator 
T(ao, cii, (12) G T^iVz) with ao / we have 



T(a , a\, a 2 ) ((</>i - wx4>o){<j>x - u>20o))) ( z 
ai u; 1 + u>2 



a z z + 2 — 
a 



z + ^1^2 1 + u>2j H 

ao ao 



and so the roots of T(ao, ax, 02) {{4>i ~ wx4>o)(<px — ^2^0)) are 



(12.1) 



Wx + W2 ax 
2 a 



Wx — W2 



+ 



02 



OlX 

a J a 



In this case we see that, as predicted by our main result, both of the new 
roots are uniformly close to the original wx,W2- Also, we see that if all 
the constants are real and a\ > aoa2 then an invertible operator in T>{T , 2) 
"sends real roots into real roots". To check the statement on the uniform 
displacement of the roots, do as follows: 

Call the roots in (|12.1|) z\ (with +) and Z2 (with — ). Since we are dealing 
with complex numbers, let us think of the square root as having one definite 
value (then the ± takes care of the second root). Also, to simplify the 
algebra, define 



Sx :-- 



ax 
«o 



and 



82 := 



So, for instance, we have 



\zx - w\\ < |^i I + 



wx — W2 



+ 5 2 



02 
a 



Wx — W2 



and similarly we can estimate Z2 — wx- Now, we have 

Wx - w 2 



wx — W2 



+ 5 2 




Wx — W2 



Wx-W2 

+ 02 H = o 2 



and so we see that at least one of the factors on the left hand side has 
modulus less than or equal to ^/^l- By what noted above, this means that 



mm 



{\zx - wx\, \z 2 - Wx\} < \Sx\ + 
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which (together with the similar estimates derived with W2 instead of w\) 
translates into the statement that the roots of 

T(ao,ai,a 2 )((<t>i - wi(f>o)((f>i - ^2^0)) 

are not farther away from w\ and W2 than the uniform quantity 



ai 


+ \ 




( ai \ 2 a 2 


ao 




\a J a 



Example 12.3. This example shows that Kh(T) < Kp(T). It was found 
with the help of Mathematica. We consider V3 and the operator T and its 
inverse given by 

2 2 n 4 o , 2 2 9 4 , 

T = I + -D + -D 2 D 3 , T = I D + - D -\ D . 

3 9 27 3 9 27 

Set f(z) = (z - l) 2 (z + 1) and g(z) = (z + l) 2 (z - 1). Then Tf = g and 
dp(Z(f), Z(g)) = 2. Mathematica calculated that 

(12.2) g[Tfo] = q[T~ Vs] = I (1 + ^2) « 1.506614. 

Therefore 

^h(T) = ^(1 + ^2) <2<K f (T) 

Since both dp(Z((ps), Z(T<ps)) and dp[^Z{cj)^), Z(T _1 03)) are equal to the 
number in (|12.2|) . this example also shows Kp(T) is not related to ^3 
in the sense of Corollary 19.21 We leave it as an open problem to ex- 
plore the existence and uniqueness of a polynomial / such that Kp{T) = 
d F {Z(f),Z(Tf)). 

Example 12.4. Several easy examples of "bad behavior" from an operator 
T which does not satisfy condition (jgj) in Theorem 17.31 can be obtained as 
follows. 

Define T by T(f) := / + /(O)0 n for / G V n . Then T is clearly linear, 
invertible and and does not commute with D. Thus, by Proposition 13.21 
T V(V n ). If w G C \{0, 1} is arbitrary, then Z(T((j) n — w4>q)) consists 
of all the n-th roots of n\w/(l — w), while all the roots of 4> n — W(j>o have 
modulus y/\w\ , thus ensuring that the distance (with respect to each of the 
distances introduced in Section |HJ) between the two multisets of roots can be 
as large as desired (just let w approach 1). 

The operator D belongs to 2?("P n ) but it is not invertible. With the same 
polynomials as above we have Z(D((j) n — w4>q)) = {0, ... ,0} (n — 1 zeros) 
again showing that the distance between Z(D(<fi n — w4>o)) an d Z(<p n — wc^q) 
(with respect to each of the distances, except dp which is not defined) can 
be as large as we wish. 

The operator defined by T(f) = / + (a n -i/n)cj>o for f(z) = ao + a\z + 
• • - + a n z n , is clearly linear, invertible and its matrix with respect to the basis 
{4>o, . . . , 4> n } of V n is upper triangular. This operator does not belong T>(V n ) 
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since it does not commute with D. Polynomials that show that none of (|dj) . 
(jjjjl, (JfJ) and (jgj) of Theorem 111.11 hold are (0i — w(j)o) n , «i£C. Namely, the 
roots of — w(f>o) n ) are on the circle centered at w with radius 

and Z((<fii — w4>o) n ) = {w, . . . , w} (n times). 

Acknowledgment The authors thank the referees for pointing out several 
related references, the article |13) in particular. This resulted in numerous 
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